Method and apparatus for detecting communication errors on a bus

ABSTRACT

A semiconductor memory includes multi-mode reporting signals, a state register, and parity detectors. The parity detector determines whether signals received on a communication bus contain a desired parity. The multi-mode reporting signals enable reporting of communication faults without adding additional signals to the semiconductor memory by being configured in a normal operating mode or a parity fault mode for reporting communication faults to an external memory controller. The state register enables storing of received values from the communication bus. With the state register, a memory controller may determine correctly received signal patterns and failing signal patterns. Parity may be defined as even or odd and may be generated based on various signal configurations. The embodiments may be configured as a computing system comprising a processor, an input device, an output device, the memory controller, and at least one semiconductor memory.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 13/311,808, filed Dec. 6, 2011, now U.S. Pat. No. 8,296,639, issued Oct. 23, 2012, which is a continuation of U.S. patent application Ser. No. 12/823,347, filed Jun. 25, 2010, now U.S. Pat. No. 8,074,159, issued Dec. 6, 2011, which is a continuation of U.S. patent application Ser. No. 11/186,713, filed Jul. 21, 2005, now U.S. Pat. No. 7,747,933, issued Jun. 29, 2010, the disclosure of each of which is hereby incorporated herein by this reference in its entirety.

BACKGROUND

1. Field

Embodiments of the present disclosure relate generally to semiconductor integrated circuits and, in particular, to error detection and remedial measures in the context of integrated circuits transmitting and receiving multi-bit address and data information.

2. Description of Related Art

Encoders and Decoders for detection and correction of data errors have long been used in integrated circuits, particularly in Dynamic Random Access Memories (DRAMs), which may be susceptible to data storage errors. Methods of parity generation, storage, and checking have often been implemented in an attempt to discover where and when storage errors occur. Generally, parity is defined as the calculation of a number of asserted signals, or bits, in a collection of signals generally referred to as a bus. Typically, a “1” is considered the asserted state. In a characteristic application, a data byte containing 8 bits may be used as the base collection. As an example, if the data byte has the value “1100 1011” five bits contain the value of “1” and three bits contain the value of “0.” To track the parity of a data byte, an additional bit may be added to the byte to indicate the parity of the byte. In this case, if odd parity is desired, the parity bit is placed in the appropriate state to make the total number of asserted bits in the collection of bits including the data byte and the parity bit an odd number. Therefore, for the case of five asserted bits in the data byte, the parity bit is de-asserted to keep the total number of asserted bits odd. If, as another example, the data byte contains two asserted bits, the odd parity bit is asserted to make the total number of asserted bits in the combination of the data byte and the parity bit an odd number, namely three in this case. Parity may also be generated and checked as even parity. In even parity, the parity bit is asserted or de-asserted to make the total number of asserted bits in the collection of the data byte and parity bit equal to an even number.

In many conventional memory systems containing parity for the detection of storage errors, the additional parity bit is stored in memory along with the data byte requiring 9 bits of memory storage for each byte of data. With this extra storage bit, if the data byte and parity are stored with odd parity, when the read occurs a check is performed to verify that odd parity is present on the read data. If not, then an error has occurred in either storage or retrieval of the data.

Additionally, systems have been developed to check that an address, or other signals, communicated from a transmitter to a receiver are received correctly. In the case of address signals, detecting and possibly attempting to correct address errors is important to prevent data from being read or written to the wrong storage location. In these address fault detection systems error detection is desired for the transmission of signals, not storage. Therefore, there is no need to store the parity bit(s). Instead valid parity is generated at the transmission end, the parity and data signals are transmitted, and a check is performed to ensure that valid parity is still present at the receiving end. In addition, using additional bits beyond the parity bit, error correction codes can be combined with the parity bit. The error correction codes accompany the transmission of data and parity, allowing correction of certain errors at the receiving end that may occur in transmission. One approach to dealing with this problem of signal transmission errors and correction techniques is seen in U.S. Pat. No. 5,173,905 to Parkinson et al.

As the need for higher speed and bandwidth to memory increases, engineers push closer to the speed and signaling boundaries where transmission errors may occur. Signals between modern semiconductor devices may have very low voltage swings or may be configured as current mode signals. The smaller voltage swings reduce the acceptable margin of error even with more precise input signal level sensors. Also, pushing the signal transmissions to higher speeds means that a shorter time period exists when the signal is in a steady state of a high or a low when it can be sensed before the signal makes a transition to the next state. Computer graphics controllers and the graphics DRAMs used in graphics memory systems are particularly high consumers of memory bandwidth and therefore vulnerable to signal transmission errors.

As a result, there is a need for simple low cost detection of signal transmission and reception errors on high speed buses, particularly graphics buses, to allow for remedial measures to be taken. Additionally, there is a need to perform this operation without adding additional Input/Output (IO) signals to devices already under severe signal count constraints.

SUMMARY

One embodiment of the invention comprises a semiconductor memory comprising at least one multi-mode reporting signal, a state register, and a parity detector for determining if a set of signals received on a communication bus contains a desired parity. The multi-mode reporting signal enables reporting of communication faults without adding additional signals to the semiconductor memory. The multi-mode reporting signal may be configured in a normal operational mode, or it may be configured in a fault reporting mode for signaling the communication fault to an external device. An external device, such as a memory controller or graphics memory controller, may place the multi-mode reporting signal in the fault reporting mode by writing to an enable unit within the semiconductor memory.

Additionally, the state register enables the storing of received values from the communication bus. This storing of received values allows an external memory controller to read the state register to determine which signals were received incorrectly. With the state register, an external device may determine what type of signal patterns are received correctly and what type of signal patterns may fail. In this error detection system, parity may be defined as even or odd. Also, parity may be generated based on various signal collections depending upon the type of signals present on the communication bus and the desired data patterns to be placed on the bus.

Another embodiment of the invention comprises a system including a memory controller and at least one semiconductor memory. In this embodiment, the memory controller may generate the proper parity for the set of signals on the communication bus and transmit the signals and parity to the at least one semiconductor memory. The at least one semiconductor memory then checks for expected parity and stores the received signal values in the state register. If a parity fault is detected and the at least one semiconductor memory is configured to report the fault, the fault is indicated to the memory controller. The memory controller may then read the state register in the at least one semiconductor memory to determine which signal was not received correctly. As a result of the determination, the memory controller may then attempt to modify various transmission characteristics in an attempt to remedy the faulty communication.

Yet another embodiment of the invention comprises a computing system including a processor, at least one input device, at least one output device, the memory controller, and at least one semiconductor memory. In this embodiment, the memory controller may physically be separate from the processor or may be on the same semiconductor device as the processor.

BRIEF DESCRIPTION OF THE DRAWINGS

In the drawings, which illustrate what is currently considered to be the best mode for carrying out the invention:

FIG. 1 is a block diagram of a system comprising a memory controller and a semiconductor memory connected by a communication bus;

FIG. 2 is a block diagram showing details of the communication bus in an exemplary implementation using a graphics DRAM; and

FIG. 3 is a block diagram showing semiconductor memories and a memory controller in a computer system.

DETAILED DESCRIPTION

In one embodiment of the invention, depicted in FIG. 1, a memory controller 300 is connected to a semiconductor memory 100 by a communication bus 200. The semiconductor memory 100 contains a conventional memory array 190 with all the associated addressing logic, reading logic, and writing logic required to access the memory array 190. In addition, the semiconductor memory 100 contains modules for detecting, storing, and reporting communication faults.

For detecting the communication faults, a parity detector 110 is connected to the communication bus 200. A parity fault may be reported if the received parity does not match a desired parity. The desired parity may be configured as even or odd parity. As described previously, parity is defined as the calculation of a number of asserted signals, or bits, in a collection of signals generally referred to as a bus. In a characteristic application, a data byte containing 8 bits may be used as the base collection. As previously noted, if the data byte has the value “1100 1011” five bits contain the value of “1” and three bits contain the value of “0.” For odd parity, the parity bit is de-asserted to make the total number of asserted signal in the combination of the data byte and parity bit equal to an odd number. On the other hand, for even parity, the parity bit is asserted to make the total number of asserted signal in the combination of the data byte and parity bit equal to an even number. This is a simple example of a typical parity implementation for a single data byte. The present invention comprises many more signals in more flexible parity arrangements.

FIG. 2 shows an exemplary embodiment of the invention using a graphics memory controller 300′ and graphics DRAM 100′. Typical address, data, and control signals are shown for a 256 Megabit graphics synchronous DRAM 100′ with a 32 bit wide data bus. When the semiconductor memory 100 is placed into the parity detection mode, various signal partitioning is possible and various signals may be used as the parity bit. For example, in a straightforward implementation, the signals may be logically segmented in to a data portion 210, an address portion 220, a control portion 230, and a parity portion defined as a logical collection of all the parity bits for all the defined portions.

In this straightforward partitioning, it may be desirable to separate each of the four data bytes (211, 212, 213, and 214) into separate parity checking collections. The parity bit associated with each data byte (211, 212, 213, and 214), while in the parity checking mode may be, for example, the write enable signals 218 for each byte, denoted in FIG. 2 as Write Data Strobes (WDQS0-3). Additionally, the address portion 220 may be defined as the address signals 222 denoted as A0-A11. Any one of the address signals 222 may be selected as the address parity bit 228. In this embodiment, A11 is selected as the address parity bit 228. Finally, the control portion 230 may be defined as any additional signals required for control of the memory device. A non-exhaustive list of these type of signals may be signals typical of any DRAM or graphics DRAM 100′ well known to those skilled in DRAM design such as; Row Address Strobe (RAS), Column Address Strobe (CAS), Write Enable (WE), Chip Select (CS#), Clock Enable (CKE#), input Data Masks (DM0-3), and Bank Addresses (BA0-1). In a control portion 230 such as this, any signal may be chosen as the control parity bit 238. For the implementation shown in FIG. 2, RAS is selected as the control parity bit 238.

Many other collections are contemplated within the scope of the invention. For example, the data bytes (211, 212, 213, and 214) may be organized into 16 bit words with one parity bit. In this configuration, as an example, WDQS0 may be associated with the 16 bit word containing data byte zero 211 and data byte one 212. WDQS2 may be associated with the 16 bit word containing data byte two 213 and data byte three 214. In another configuration for the data portion, the entire data bus may be configured with a single data parity bit such as WDQS0. Similarly, the address portion 220 may contain additional signals such as the bank addresses BA0-1. In this address portion 220 configuration it may be desirable to designate BA0 as one address parity bit 228 and BA1 as an additional address parity bit 228. The address bus may then be split into two portions, such as A0-A5 as one portion with BA0 as a first address parity bit 228 and A6-A11 as the other portion with BA1 as a second address parity bit 228. Yet another configuration may move the Data Mask signals DM0-3, from the control portion 230 to the data portion 210 either as data bits or possibly as data parity bits. It will be clear to a person skilled in the art that many different combinations are possible. Additionally, allowing configuration in different modes is desirable for flexibility in analysis of communication faults.

Reporting the communication fault is performed by at least one multi-mode reporting signal 240. In the exemplary embodiment shown in FIG. 2, the multi-mode reporting signals 240 are implemented as the Read Data Strobes (RDQS0-3) on a graphics DRAM 100′. Implementing a plurality of multi-mode reporting signals 240 allows multiple parity errors to be reported for different signal portions such as the address portion 220, control portion 230, and data portion 210. Segmenting the parity faults this way may assist the memory controller 300 in determining where the communication fault exists. Clearly, a single multi-mode reporting signal 240 is also possible to report a communication fault anywhere on the communication bus 200.

Communication fault reporting may be disabled. When communication fault reporting is disabled, by writing to a control register in the graphics DRAM 100′, the multi-mode reporting signal 240 is configured to perform its normal operational function. Additionally, when the system is configured to operate in a normal, non-parity mode, the address, data, and control signals designated as parity bits may be configured, in the memory controller 300, to perform normal operational functions rather than performing the parity bit function.

However, if detection of communication faults is desired, the multi-mode reporting signal 240 may be placed in a fault reporting mode whereby the multi-mode reporting signal 240 is asserted whenever a communication event contains a parity error. As long as communication events are received with proper parity, the multi-mode reporting signal 240 will remain de-asserted. The assertion level of the multi-mode reporting signal 240 in the fault reporting mode may be defined as high or low depending on the system application and requirements of the memory controller 300.

To track where and when a communication fault occurs, a state register 120 (FIG. 1) stores the values of the set of signals on the communication bus 200 for each communication event. The state register 120 may be enabled, by the enable unit 130, to begin collecting communication events independent of whether communication faults are reported on the multi-mode reporting signal 240. When enabled, the state register 120 reloads the state of the communication bus 200 for each communication event until the parity detector 110 detects a communication fault. At the point where a communication fault is detected, storage of further communication events is disabled so that the state register 120 contains the signal values for the faulty communication event. A memory controller 300 may then read the state register 120 to determine which signal was not received correctly. The state register 120 may then be re-armed by the enable unit 130 to collect additional communication events.

If communication event errors are detected, they may be reported to a memory controller 300. The memory controller 300, as shown in FIG. 1, comprises a parity generator 310, a transmitter 320, a fault receiver 330, and a remediation unit 340. When in a mode of checking for communication errors, the parity generator 310 creates proper parity, either even or odd, for the communication bus 200 using the desired partitioning described above. The transmitter 320 sends the data and parity signals on the communication bus 200. When communication faults are detected by the semiconductor memory 100, they may be reported to the fault receiver 330 on the multi-mode reporting signal(s) 240.

If desired, the system comprising a memory controller 300 and semiconductor memory 100 may be configured to attempt remedial measures for repairing communication errors by modifying various transmission characteristics of the communication bus 200. Memory controllers 300 and semiconductor memories 100 typically contain components for modifying the impedance levels of output drivers. Adjusting these impedance levels may help reduce signaling problems such as ringing and overshoot. Some memory buses are configured with current mode outputs. In these systems, in addition to adjusting output impedance, the communication bus 200 may have termination resistors on the signals of the communication bus 200. Adjusting the value of these resistors may reduce signaling problems. Input pins are often configured to sense the switch from a high to low, or low to high, at a specific voltage level supplied by the system. Adjusting this voltage level may reduce communication errors.

Finally, various timing adjustments are possible, such as when various outputs are triggered to switch state. For example, the switching of a plurality of outputs may be staggered such that not all the outputs of the plurality switch at the same time. Also, the various signal types (e.g., data, control, address) may be varied slightly in when they switch relative to each other to assist in timing issues such as input setup and hold problems. Other timing relationships and methods to modify signal transmission characteristics are also within the scope of the present invention.

Another embodiment of the invention, as shown in FIG. 3, comprises a computer system 500 comprising a processing module 510, at least one input device 520 and at least one output device 530. The processing module 510 comprises a processor 515, a memory controller 300, and at least one semiconductor memory 100 containing the communication fault detection apparatus according to the present invention. In this system, the memory controller 300 may be a standard memory controller 300 or a graphics DRAM controller 300′. Additionally, the memory controller 300 may be configured such that it is physically located within the processor 515 (not shown).

Although this invention has been described with reference to particular embodiments, the invention is not limited to these described embodiments. Rather, the invention is limited only by the appended claims, which include within their scope all equivalent apparatuses and methods that operate according to the principles of the invention as described. 

What is claimed is:
 1. A method of adjusting a communication bus, comprising: receiving information on the communication bus; determining a parity of the received information; determining a communication fault responsive to a mismatch between the parity and a desired parity; modifying a signal characteristic of one or more signals on the communication bus responsive to the mismatch; and changing a multi-mode reporting signal from an operational mode during the receiving information to a fault reporting mode responsive to the mismatch.
 2. The method of claim 1, wherein the modifying the signal characteristic comprises modifying an impedance of a driver operably coupled to the one or more signals.
 3. The method of claim 1, wherein the modifying the signal characteristic comprises modifying a termination resistance operably coupled to the one or more signals.
 4. The method of claim 1, wherein the modifying the signal characteristic comprises modifying a voltage reference level for inputs on the communication bus.
 5. The method of claim 1, wherein the modifying the signal characteristic comprises modifying a timing delay in receiving the communication bus responsive to the mismatch.
 6. The method of claim 1, wherein the modifying the signal characteristic comprises modifying a timing relationship between at least two different signals of the communication bus responsive to the mismatch.
 7. The method of claim 6, wherein the modifying the timing relationship between the at least two different signals further comprises modifying at least one of: a timing relationship between a data portion of the communication bus and a control portion of the communication bus; a timing relationship between the data portion and an address portion of the communication bus; and a timing relationship between the address portion and the control portion.
 8. The method of claim 1, further comprising placing the multi-mode reporting signal in the operational mode to disable the reporting of the communication fault.
 9. A semiconductor memory, comprising: a communication bus including one or more signals with a capability to modify a signal characteristic of the one or more signals responsive to a faulty communication event; a parity detector configured to determine the faulty communication event responsive to a comparison of a received parity to a desired parity; and a multi-mode reporting signal configured in a fault reporting mode responsive to the faulty communication event and in an operational mode to perform a normal operational function as part of the communication bus when not in the fault reporting mode.
 10. The semiconductor memory of claim 9, wherein the one or more signals include a capability for modifying a driver impedance of at least one driver operably coupled to at least one of the one or more signals.
 11. The semiconductor memory of claim 9, wherein the one or more signals include a capability for modifying a termination resistance of at least one receiver operably coupled to at least one of the one or more signals.
 12. The semiconductor memory of claim 9, wherein the one or more signals include a capability for modifying a voltage reference level of at least one receiver operably coupled to at least one of the one or more signals.
 13. The semiconductor memory of claim 9, wherein the one or more signals include at least two signals with receivers operably coupled to the at least two signals wherein the receivers include capabilities for modifying a timing delay between the at least two signals.
 14. The semiconductor memory of claim 9, wherein the one or more signals include at least two signals with drivers operably coupled to the at least two signals wherein the drivers include capabilities for modifying a timing delay between the at least two signals.
 15. The semiconductor memory of claim 9, further comprising a state register configured to store a state of the communication bus at a time of the faulty communication event for subsequent analysis of the state of the communication bus during the faulty communication event.
 16. The semiconductor memory of claim 9, further comprising an enable unit configured to enable indication of the faulty communication event by placing the multi-mode reporting signal in the fault reporting mode and disable the indication of the faulty communication event by placing the multi-mode reporting signal in the operational mode.
 17. A memory controller, comprising: a communication bus including one or more signals with a capability to modify a signal characteristic of the one or more signals responsive to a faulty communication event; a parity generator configured to generate a transmission parity for the communication bus; a transmitter configured to transmit the transmission parity with the one or more signals on the communication bus; and a remediation unit configured to attempt to eliminate a communication fault reported on a reporting signal of the communication bus by modifying the signal characteristic of the one or more signals.
 18. The memory controller of claim 17, wherein the signal characteristic comprises an output impedance of at least one of the one or more signals.
 19. The memory controller of claim 17, wherein the signal characteristic comprises a timing delay for at least one of the one or more signals.
 20. The memory controller of claim 17, wherein the signal characteristic comprises a timing relationship between at least two different signals of the one or more signals.
 21. The memory controller of claim 20, wherein the timing relationship between the at least two different signals is selected from of the group consisting of: a timing relationship between a data portion of the communication bus and a control portion of the communication bus; a timing relationship between the data portion and an address portion of the communication bus; and a timing relationship between the address portion and the control portion. 